RANDOM WALK TERM WEIGHTING FOR IMPROVED TEXT CLASSIFICATION

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Imbalanced text classification: A term weighting approach

The natural distribution of textual data used in text classification is often imbalanced. Categories with fewer examples are under-represented and their classifiers often perform far below satisfactory. We tackle this problem using a simple probability based term weighting scheme to better distinguish documents in minor categories. This new scheme directly utilizes two critical information rati...

متن کامل

An Improved Feature Weighting Method for Text Classification

Feature extraction is the important prerequisite of classifying text effectively and automatically. TF· IDF is widely used to express the text feature weight. But it has some problems. TF•IDF can’t reflect the distribution of terms in the text, and then can’t reflect the importance degree and the difference between categories. This paper proposes a new feature weighting method—TF•IDF•Ci to whic...

متن کامل

Term-Weighting Learning via Genetic Programming for Text Classification

This paper describes a novel approach to learning term-weighting schemes (TWSs) in the context of text classification. In text mining a TWS determines the way in which documents will be represented in a vector space model, before applying a classifier. Whereas acceptable performance has been obtained with standard TWSs (e.g., Boolean and term-frequency schemes), the definition of TWSs has been ...

متن کامل

Biomedical Text Classification with Improved Feature Weighting Method

In bioinformatics, we are interested in new techniques and advances in classification of biomedical documents for the hope of extracting useful biomedical knowledge out of the classification task. In this paper we introduce a feature weighting method for improving biomedical text classification. The method is effective in inducing weighted features from text data for classification. The weight ...

متن کامل

Text Classification by PNN-based Term Re-weighting

Current approaches to feature selection for text classification aim to reduce the number of terms that are used to describe documents. Thus, documents can be classified and found with greater ease and precision. A key shortcoming of these approaches is that they select the topmost terms to describe documents after ranking all terms using a feature selection measure (scoring function). Lesser hi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Semantic Computing

سال: 2007

ISSN: 1793-351X,1793-7108

DOI: 10.1142/s1793351x07000263